Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
نویسندگان
چکیده
منابع مشابه
Empirically combining unnormalized NNLM and back-off N-gram for fast N-best rescoring in speech recognition
Neural network language models (NNLM) have been proved to be quite powerful for sequence modeling, including feed-forward NNLM (FNNLM), recurrent NNLM (RNNLM), etc. One main issue concerned for NNLM is the heavy computational burden of the output layer, where the output needs to be probabilistically normalized and the normalizing factors require lots of computation. How to fast rescore the N-be...
متن کاملUse of Knowledge Graph in Rescoring the N-Best List in Automatic Speech Recognition
With the evolution of neural network based methods, automatic speech recognition (ASR) field has been advanced to a level where building an application with speech interface is a reality. Inspite of these advances, building a real-time speech recogniser faces several problems such as low recognition accuracy, domain constraint and out-of-vocabulary words. The low recognition accuracy problem is...
متن کاملWeight Estimation for N-Best Rescoring
1. I N T R O D U C T I O N The N-Best rescoring paradigm involves the generation of a list of the N best sentence hypotheses by a recognition system and the subsequent rescoring of these hypotheses by other knowledge sources. The sentence hypotheses are then reranked according to a weighted linear combination of the different scores. This paradigm has the potential of achieving bet ter performa...
متن کاملRescoring n-best lists for Russian speech recognition using factored language models
In this paper, we present a research of factored language model (FLM) for rescoring N-best lists for Russian speech recognition task. As a baseline language model we used a 3gram language model. Both baseline and factored language models were trained on a text corpus collected from recent news texts on Internet sites of online newspapers; total size of the corpus is about 350 million words (2.4...
متن کاملRescoring N-Best Hypotheses for Arabic Speech Recognition: A Syntax- Mining Approach
Improving speech recognition accuracy through linguistic knowledge is a major research area in automatic speech recognition systems. In this paper, we present a syntax-mining approach to rescore N-Best hypotheses for Arabic speech recognition systems. The method depends on a machine learning tool (WEKA-3-6-5) to extract the N-Best syntactic rules of the Baseline tagged transcription corpus whic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Audio, Speech, and Music Processing
سال: 2014
ISSN: 1687-4722
DOI: 10.1186/1687-4722-2014-19